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Abstract 

We study multivariate integration of functions that are invariant under the permutation (of 
a subset) of their arguments. Recently, in Nuyens, Suryanarayana, and Weimar [10] (Adv. 
Comput. Math. (2016), 42(l):55-84), the authors derived an upper estimate for the nth min¬ 
imal worst case error for such problems, and showed that under certain conditions this upper 
bound only weakly depends on the dimension. We extend these results by proposing two 
(semi-) explicit construction schemes. We develop a component-by-component algorithm to 
find the generating vector for a shifted rank-1 lattice rule that obtains a rate of convergence 
arbitrarily close to 0(n~ a ), where a > 1/2 denotes the smoothness of our function space 
and n is the number of cubature nodes. Further, we develop a semi-constructive algorithm 
that builds on point sets which can be used to approximate the integrands of interest with 
a small error; the cubature error is then bounded by the error of approximation. Here 
the same rate of convergence is achieved while the dependence of the error bounds on the 
dimension d is significantly improved. 

Keywords: Numerical integration, Quadrature, Cubature, Quasi-Monte Carlo methods, 
Rank-1 lattice rules, Component-by-component construction. 

Subject Classification: 65D30, 65D32, 65C05, 65Y20, 68Q25, 68W40. 


1 Introduction 

In recent times, the efficient calculation of multidimensional integrals has become more and more 
important, especially when working with a very high number of dimensions. It is well-known that 
such problems are numerically feasible only if certain additional assumptions on the integrands of 
interest are imposed. In this paper we seek to construct cubature rules to approximate integrals 
of d-variate functions which are invariant under the permutation of (subsets of) their arguments. 
Such a setting was studied earlier in [16] in the context of approximations to general linear 
operators and also recently in [10, 17] for the special case of integration. These investigations 
were motivated by recent research by Yserentant [19], who proved that the rate of convergence 
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of the approximations to the electronic Schrodinger equation of iV-electron systems does not 
depend on the number of electrons N, thus showing that it is independent of the number of 
variables associated with the numerical problem. This is due to the fact that there is some 
inherent symmetry in the exact solutions (called electronic wave functions) to these equations. 
In fact, such functions are antisymmetric w.r.t. the exchange of electrons having the same spin, 
as described by Pauli’s exclusion principle which is well-known in theoretical physics. In [16], 
general linear problems defined on spaces equipped with this type of (anti-) symmetry constraints 
were shown to be (strongly) polynomially tractable (under certain conditions). That is, the 
complexity of solving such problems depends at most weakly on the dimension. Inspired by 
these results, it was shown in [10] that under moderate assumptions, the nth minimal worst case 
error for integration in the permutation-invariant (hence symmetric) setting of certain periodic 
function classes can also be bounded nicely. In particular, the existence of quasi-Monte Carlo 
(QMC) rules that can attain the rate of convergence 0(n -1 / 2 ) (known from Monte Carlo methods) 
was established. If there are enough variables that are equivalent with respect to permutations, 
then the implied constant in this bound grows only polynomially with the dimension d. Hence, 
the integration problem was shown to be polynomially tractable. However, the proofs given in 
[10] are based on non-constructive averaging arguments. 

In this paper, we extend the results of [10] and provide two (semi-) explicit construction 
schemes. First, we present a procedure which is inspired by the well-known component-by- 
component (CBC) search algorithm for lattice rules, introduced by Sloan and Reztsov [14]. 
More precisely, we prove that the generating vector of a randomly shifted rank-1 lattice rule 
which achieves an order of convergence arbitrarily close to 0(n~ a ) can be found component-by- 
component. Here a > 1/2 denotes the smoothness of our permutation-invariant integrands and 
n is the number of cubature nodes from a d-dimensional integration lattice. Originally, the CBC 
algorithm was developed with the aim of reducing the search space for the generating vector, i.e., 
to make the search quicker. Later on it was shown by Kuo [4] that this algorithm can also be 
used to find lattice rules that achieve optimal rates of convergence in weighted function spaces 
of Korobov and Sobolev type, and this improved on the Monte Carlo rate 0(n -1 / 2 ) shown pre¬ 
viously in [5, 13] and [12]. For an overview of these schemes for different problem settings, we 
refer to [2, 9]. 

Next, we present a completely different approach towards cubature rules that does not rely 
on the concept of lattice points. Based on the knowledge about sampling points that are suitable 
for approximation, we iteratively derive a cubature rule which attains the desired order of con¬ 
vergence for solving the integration problem in more general reproducing kernel Hilbert spaces 
(RKHSs). The basic idea of this type of algorithms can be traced back to Wasilkowski [15]. The 
proof relies on the fact that in RKHSs the worst case error for integration can be controlled by 
an average case error for the function approximation problem. This way we arrive at a cubature 
rule which achieves a rate of convergence arbitrarily close to the optimal rate of 0(n~ a ), and 
prove that under certain conditions, we can achieve (strong) polynomial tractability. 

Our material is organized as follows. In Section 2 we briefly recall the problem setting dis¬ 
cussed in [10]. In particular, we define the permutation-invariant subspaces of the periodic 
RKHSs under consideration. Section 3 introduces the worst case error of the problem and 
Proposition 3.1 outlines the main tractability results derived in [10]. Section 4 and Section 5 
contain our main results, namely the new CBC construction and the application of the second 
cubature rule explained to permutation invariant subspaces, respectively. In Section 4 we start 
with a brief description of (shifted) rank-1 lattice rules and recall some related results obtained in 
[10]. Then, Theorem 4.5 in Section 4.2 contains our new CBC construction. Section 5 is devoted 
to the alternate approach. We begin with a short overview of basic facts related to approximation 
problems w.r.t. the average case setting in Section 5.1. Afterwards, in Section 5.2, our quadrature 
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rule for general RKHSs is derived. The corresponding error analysis can be found in Theorem 5.8. 
Finally, in Section 5.3, the obtained results are applied to the permutation-invariant setting de¬ 
scribed in Section 2. The paper is concluded with an appendix which contains the proofs of some 
technical lemmas needed in our derivation. 

We briefly introduce some notation that will be used throughout the paper. Vectors are 
denoted by bold face letters, e.g., x,k,h. In contrast, scalars are denoted by, e.g., x,k,h. To 
denote multidimensional spectral indices we mainly use h or k. The elements of a d-dimensional 
vector are denoted as k = (k\,..., kd )■ We use x ■ y to denote the inner product or the dot 
product of two vectors x and y. The prefix # is used for the cardinality of a set. Finally, the 
norm of a function / in a function space Hd is denoted by \\f \ Hd\\. 


2 Setting 

We study multivariate integration 


Int d f := / f(x) dx 

J[ o,i] d 

for functions from subsets of some Hilbert space of periodic functions 
F d {r a ,p) := \ f : [0, l] d C 


( 1 ) 


/ £ L 2 ([0, l] d ) and ||/ | F d {r a43 )|| 2 := ^ | /(fc) r 0ij3 (fc) < oo 


Here /(fc) are the Fourier coefficients of /, given by 


f( k ) : = (/,exp(27rifc • -))l 2 = / / (x) exp(—27ri k ■ x)dx, 

J[o,i] d 

and r a d 3 : —> (0, 00 ) is a d-fold tensor product involving some generating function R: [1, 00 ) —> 

(0,oo) and a tuple (3 = (/3o,/3i) of positive parameters such that 

d 

r a A k ) = n (<W< • Po 1 + (1 - 80 , kl ) ■ PA ■ R(\h \) 2a ), fee z d . 


Note that the problem is said to be well scaled if /3o = 1. The parameter a > 0 describes the 
smoothness. Throughout this paper we assume that 

Rin, Tfi ) 

— R(m) < - < R(m) for all n, m e N, and some cr > 1. 

cr n 

Moreover, we assume that ( R(m)~ 1 ) m e i 2 a ; i.e., 


: = 


m— 1 


R(m) 


2 a 


< 00 . 


The functions / € F ( j(r a p) possess an absolutely convergent Fourier expansion. Also, the above 
assumptions imply R(m) ~ m and a > 1/2, and for these conditions, it is known that Fd is a 
d-fold tensor product of some univariate reproducing kernel Hilbert space (RKHS) F± = H(Ki) 
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with kernel K i, see, e.g., [ 6 , Appendix A]. The inner product on the RKHS F,j = H(Kd) is then 
given by 

( f,9 ) = r ^A k )K k )9( k )- 

fee i d 


We now recall the definition of Id-permutation-invariant functions f £ Fd(r ad 3 ), where Id C 
{1,... , d} is some fixed subset of coordinates. As discussed in [16, 17] and [10] these functions 
satisfy the constraint that they are invariant under all permutations of the variables with indices 
in I d ; i.e., 

f(x) = f{P(x)) for all x £ [0, l] d and all P £ Sd, (2) 

where 


Sd «5{i i ... t d}{Id) j-P ; {!)••■) d} —> { 1 ,..., d} 


P a bijection such that P|^ d}\i d 



(Note that this set always contains at least the identity permutation.) For example, if Id = 
{ 1 , 2 }, then Sd = {( 1 , 2 ,..., d) i-*- ( 1 , 2 ,..., d), ( 1 , 2 ,..., d) ( 2 , 1 ,..., d )} and ( 2 ) reduces to 
the condition f(x 1 , £ 2 , • • •, Xd ) = f(x 2 , £ 1 , • • •, Xd)- With slight abuse of notation, we shall use 
P also in the functional notation; i.e., we let 


P(x) := (x P (i),... ,x P(d) ) , x= (x!,... : x d ) £ [0,l] d . 


The subspaces of all /^-permutation-invariant functions in Fd will be denoted by &i d (Fd(r a ^p)). 
For fully permutation-invariant functions, we will simply write &(Fd(r a ^p)). It is known that if 
Id = ■ • • ,i#i d }, the set of all 




’ a,/3 


(fc) 


ffSd ■ M d (k)\ 


exp(27riP(fc) • x) 

P&S d 


with k from the set 

Vd := V { 1 .... td }{Id) ■= = (h, ...,kd)&% d k h <k i2 < ■■■ < k i#ld | 


constitutes an orthonormal basis of &i d {Fd{r a ^p))\ see [16] for more details. Here, 


Md(fc)! := M {li ..., d} (fc, I d )\ ■■= #{P £ S d \ P(k ) = k}, 

accounts for the repetitions in the multiindex k. Since the subspace &i d {Fd(r at p)) is equipped 
with the same norm as the entire space Fd{r a ^p ), it is again a RKHS. Moreover, from [10] we 
know that its reproducing kernel is given by 

r ~\ (h) 

Kd,i d (x,y) = ex P( 27ri/x • ( p ( x ) - V)), x,y£[ 0, l] d . (3) 

he zd rr d p&Sd 


Finally, we mention that (using a suitable rearrangement of coordinates) the space &i d (Fd(r a , 0 )) 
can be seen as the tensor product of the fully permutation-invariant subset of the ffld~v ariate 
space with the entire (d — ffld)~v ariate space; i.e., 


<5i d (Fd(r a ,p)) = &{F#i d {ra,p)) ® F d -#i d {r a ,p). 


Hence, the reproducing kernel also factorizes like 


K d ,i d = % dl {i.® K d -#i d - 

For more details about the setting, we refer the reader to [10]. 
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3 Worst case error and tractability 

We approximate the integral (1) by some cubature rule 

1 n —1 

Qd,n(f) '■= Qd,n(f;t {0 \ ... ,t (n ~ l \w 0l ... := , d,n£ N, (4) 

n j =o 

that samples / at some given points v-^ £ [0, l] d , j = 0 ,... ,n — 1, where the weights Wj are 
well-chosen real numbers. If vjq = • • • = w n -i = 1, then Qd, n is a quasi-Monte Carlo (QMC) 
rule which we will denote by QMC d n := QMC d n ( ■; t(°\ ..., 

If K is the 2d-variate, real-valued reproducing kernel of some (separable) RKHS H d of func¬ 
tions on [ 0 , l] d , the worst case error of Q dt n is given by 


e wor (Q d; „; H d ) 2 := 


SUp |Int d / - Qd,n{f) | 
yf&H d ,Wf | H d \\<l 


n— 1 


'[ 0 ,l] d ^[ 0 ,l] d 

n—1 


I\(x, y) &x dy - 

n ^ 


Wj 


K 


3=0 


'[ 0 ,l] d 


^ X, t^'j 


dx 


1 


n ^ 


Y. WjW k K{t^\t (k) ^j , 


(5) 


t,fc=o 


see, e.g., Hickernell and Wozniakowski [3]. The nth minimal worst case error for integration on 
H d is then given by 


e(n, d; H d ) := inf e wor (Q d , n ; H d ). 

Qd,n 

Here the infimum is taken with respect to some class of cubature rules Q dt n which use at most 
n samples of the input function. 

We briefly recall the concepts of tractability that will be used in this paper, as described 
in Novak and Wozniakowski [ 6 ]. Let n = n{e,d) denote the information complexity w.r.t. the 
normalized error criteion, i.e., the minimal number of function values necessary to reduce the 
initial error e(0, d; H d ) by a factor of £ > 0, in the d-variate case. Then a problem is said to be 
polynomially tractable if n(e, d) is upper bounded by some polynomial in £ -1 and d, i.e., if there 
exist constants C, p > 0 and q > 0 such that for all d G N and every e £ (0,1) 

n(e,d) < C d q e~ p . ( 6 ) 


If this bound is independent of d, i.e., if we can take q = 0, then the problem is said to be strongly 
polynomially tractable. Problems are called polynomially intractable if ( 6 ) does not hold for any 
such choice of C,p, and q. For the sake of completeness, we mention that a problem is said to be 
weakly tractable if its information complexity does not grow exponentially with and d, i.e., if 


lim 

£— 1 -\-d—>oo 


lnn(e, d) 
£ —1 + d 


= 0 . 


In [10, Theorem 3.6] the following tractability result has been shown for the spaces H d = 
&i d {F d (r a ^)). 
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( 7 ) 


Proposition 3.1. For d >2, let Id C {1,..., d} with ffl d > 2 and assume 


2 Pi 

fto R(m) 2a 


< 1 


for all m € N. 


Consider the integration problem on the Id-permutation-invariant subspace &i d (F d (r a iJ g)) of 
Fd{r a ,p)- Then 

• for all n and d £ N, the nth minimal worst case error is bounded by 


e(n,d;&i d (F d (r a ,p))) < e(0, d; 6 Id (F d (r a ^))) JV 


1 — r/* 




where the constants V* and rj* are chosen such that 


:=V*(V*)'-= E 


2 pi 


m=V* + l 


Po R(m) 2 


< 1 . 


( 8 ) 


(9) 


• there exists a QMC rule which achieves this bound. 

• if d— ffl d € 0(ln<i), then the integration problem is polynomially tractable (with respect to 
the worst case setting and the normalized error criterion). 

• if d — f/I d £ 0(1) and (9) holds for V* = 0, then we obtain strong polynomial tractability. 

It is observed in [10] that for the periodic unanchored Sobolev space, i.e., Pq = /3i = 1 
and R(m) = 2nm 7 the assumption (7) is fulfilled if a > 1/2. In addition, for sufficiently 
many permutation-invariance conditions and sufficiently large a, we even have strong polyno¬ 
mial tractability. Note that for a near 1/2, the factor (1 — f?*) -1 / 2 is extremely large, whereas for 
a = 1 we already have (1 — ry*) -1 / 2 < 1.05. We stress the importance of the above tractability 
result by mentioning that the integration problem on the full space (I d = 0) is not even weakly 
tractable. That is, in this case the information complexity n(e, d) grows at least exponentially 
in £ -1 + d. We refer to [10] for a more detailed discussion about this result. 


4 Component-by-component construction of rank-1 lattice 
rules 

4.1 Definitions and known results 

We briefly introduce unshifted and shifted rank-1 lattice rules. For n € N, an n-point d- 
dimensional rank-1 lattice rule Q d , n (z) is a QMC rule (i.e., an operator as given in (4) with wq = 

• • • = w n - 1 = 1) which is fully determined by its generating vector ztZj := (0,1,..., n — l} d . 
It uses points t/T from an integration lattice L(z : n) induced by 2 : 

t(j) .— / Ff_ 1 £4 m0( ] 1 f 01 - j = 0, 1 ,..., n — 1 . 

(h) n 
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We will restrict ourselves to prime numbers n for simplicity. The following character property 
over K w.r.t. the trigonometric basis is useful: 


1 

n 


n— 1 

Y exp(27ri (h • z)j/n) 
j =o 


1 if h ■ z = 0 (mod n), 
0 otherwise. 


( 10 ) 


The collection of h £ Z d for which this sum is one is called the dual lattice and we denote it by 
L(z,n) ± . It has been shown in [10, Corollary 4.7] that irrespective of the set Id, for standard 
choices of r ad g (such as in the periodic Sobolev space or Korobov space), the class of unshifted 
lattice rules Q d ,n{z) is too small to obtain strong polynomial tractability. We thus consider 
randomly shifted rank-1 lattice rules in what follows. 

For n £ N, an n-point d-dimensional shifted rank-1 lattice rule consists of an unshifted rule 
Qd,n(z) shifted by some shift A £ [0, l) d ; i.e., it’s points are given by 


:= + A | ^ — + A^j mod 1 for j = 0, 1,..., n — 1. 

We will denote such a cubature rule by Q d ,n{z) + A. 

The root mean squared worst case error helps in establishing the existence of good shifts. It 
is given by 

E{Q d , n {z)) := ( f e WOT (Q d , n {z) + A; &i d {F d (r a ^))) 2 dA 
\J[ o.i) d 

The above error can be calculated using the shift-invariant kernel 



Kf% v ( x ,y) := / K d j d ({x + A}, {y + A}) dA, x, y £ [0, l] d , 

J[o,i) d 

associated with K d ,i d given in (3). From [10, Proposition 4.8] we know that for d £ N and 
/jC {1,..., d} the shift-invariant kernel is given by 

r~ l (k) 

K tr7( x ’ y)= Y tfg Y exp ( 2?ri p ( fc ) ■ (* - y)) > [o, i] d , 

fceVd ^ d PaSd 

and, for every unshifted rank-1 lattice rule Q d ,n{z) the mean squared worst case error satisfies 

E(Qd,n(z)) 2 = e™ r (Q dl „(z); fl$7) 2 (11) 

r _1 (k) 

= Y X! 1 P{k)£L( z ,n)± > c maxjd - #J d , 1} n~ 2a , 

o/fceVd ^ d Pes d 

where c = 2f3iiin(a)/fio does not depend on d and n , see [10, Theorem 4.9]. Here Hf 1 ™ denotes 
the RKHS with kernel Kf 1 ™ and L(z, n )- L is the dual lattice induced hy z £ Moreover, the 
following existence result has been derived; cf. [10, Theorems 4.9 and 4.11]. 

Proposition 4.1. Let d £ N, I d C {1,... ,d}, and n £ N with n > c R be prime. Then there 
exists a shifted rank-1 lattice rule Q d ,n(z*) + A* for integration of Id-permutation-invariant 
functions in F d (r a i/ g) such that 

e wor (O d ,n(^*) + A* ; 6 Id (F d (r a ,p))) 2 < E(Q d , n (z*)) 2 

< (l+c R ) x C d ^(r a ,p)\ for all 1 < X < 2a (12) 
n A 
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with cr as defined in Section 2 and 


Cd,\(. r »,p) ■= I y 

\o^h& d 

Remark 4.2. Some remarks are in order: 

(i) In particular, for A = 1, the above proposition implies that there exists a shifted rank-1 
lattice rule such that 

E(Q d , n (z*)) < V2^C d ,i(r a ,p) 112 n~ 1 ' 2 (14) 

and it can be seen (cf. [10, Corollary 4.14]) that this bound differs from (8) only by a small 
factor which does not depend on d, provided that (7) is fulfilled. 

(ii) If 1 < A < 2a, then (independent of Id) there is an exponential dependence of Cd,\(r a ^p) on 

the dimension d which gets stronger as A increases; this growth in the associated constant 
is typical and can be observed also when dealing with lattice rule constructions for classical 
spaces (without permutation-invariance). For A —> 2a, we even have C d ,\(r a ,p) —> oo. 
Nevertheless, without going into details, we mention that the dependence of this constant on 
the dimension can be controlled using additional constraints on the parameters (3 = (f3o, /3i). 
We stress that these additional constraints are rather mild and would not achieve a similar 
tractability behavior for the full space. For an extensive discussion of lower and upper 
bounds for Cd,\(r a ,p) we refer to [10, Proposition 4.12]. □ 


M d (h)\ ! 


#S d 


a,P 


(h) 


(13) 


4.2 The component-by-component construction 

Here we derive a component-by-component (CBC) construction to search for a generating vector 
z* £ Z() such that (for some well-chosen shift A* £ [0, l) d ) the corresponding shifted rank-1 
lattice rule Qd,n(z*) + A* satisfies an error bound similar to the one given in Proposition 4.1. 
Our approach is motivated by similar constructions that exist for standard spaces defined via 
decay conditions of Fourier coefficients; see, e.g., [4, 13, 14]. 

We will further make use of the following notation. Let d £ N and Id C {l,...,d}. For 
subsets 0 ^ u C {1,..., d} and vectors h = (hi,..., h d ) £ we denote by h u := (hj)j &u the 

restriction of h to u. Hence, h u £ Z u is a #u-dimensional integer vector indexed by coordinates 
in u. Given h u £ Z u , its trivial extension to the index set {1,..., d} is denoted by (h u \ 0) £ Z d ; 
i.e., 

( h u ;0) u :=h u and (fa u ;0)£:=0 for all I £ {1,..., d} \ u. 

Finally, the restriction of the set of admissible permutations and their associated multiplicities 
(see Section 2 for the original definitions) to the subset u will be abbreviated by 


S u j d := S u (I d Hu) := jP: u —> u P a bijection such that P| u \( /dnu ) = idj 


and 

M u Jd ( h u y. := M u (h u , I d D u)! := # {P £ S uJd \ P(h u ) = h u } , h u £ Z u , 
respectively. 

To derive the component-by-component construction, we will need a little preparation. First, 
let us recall a technical assertion which can be found in [10, Lemma 4.10]. 






Lemma 4.3. Let d £ N, h £ if, and n £ N be prime. Then 



^ ^ ^-hGL(z,n)- L 


n —1 d n —1 


^eii^e exp(27ri jh e ze/r 


n z —' n 

j =0 £—1 zg —0 


1 if h = 0 (mod n), 

— otherwise , 
n 


where L(z,n) ± denotes the dual lattice induced by z and h = 0 (mod n) is a shorthand for ht = 0 
(mod n) for all £ = 1,..., d. 

Proof. The proof is based on the character property given in (10). We refer to [10] for the 
complete proof. ■ 


Furthermore, we will use a dimension-wise decomposition of the mean squared worst case 
error (11), as given below. 


Proposition 4.4. Let d £ N, Id C {l,...,d}, and n £ N be prime. Then for all generating 
vectors z = (z i,.. •, z d ) £ Uf the mean squared worst case error of Q d ,n(z) + A w.r.t. all shifts 
A £ [0, l) d is given by 

d 

E(Qd,n(z)) 2 ■= Po E £ • • •, */), (15) 

£=1 

where we set 


B Id ,t(zi,...,zt) ■■= 


E - 

uC{l, 

eeu 


: ul d E 

/iu6(Z\{0}) u 


,I d 


(K) 




Id 


r -l 

' a,/3 


(K) 1 


h u £L(z u ,n ) 


(16) 


for £=l,...,d and c uJd = P* U {#* d nu)) ■ 

Proof. Let i/uC {1,..., d} and assume that h = ( hi ,..., hd) £ if \ {0} with hj 0 if and 
only if j £ u and hj = 0 otherwise. Then, clearly, 

r a^( h ) = r a^( h u) and, for all 2 € Z^, h ■ z = h u ■ z u ; 

i.e., h £ L(z,n) ± if and only if h u £ L(z u ,n ) ± . Moreover, for all such h = (h u ; 0) we have 

M d (h)l _ M {1 . d} ((fe u ;0), J d )! _ #(J d \u)!M u , 7 ,(/i u )! _ / #/„ y 1 M uJd (h u )\ 

#s d #i d \ #i d \ V#(^n u)J #S uJd ’ 

since #S u j d = #(/ d fl u)!. Further, note that the collection of all non-empty u C {1,... ,d} 
can be written as the disjoint union U 7= i {uC{l,...,l}|<£u}. This gives rise to the disjoint 
decomposition 


d 

Z d \{0}= U {(fc u ;0)|h u e(z\{0}) u }= |J U {(h u -, 0) | h u 6 (Z \ {0}) u } . 

0^uC{l,...,d} <=luC{l,...,<), 

i&u 


Next, observe that for every function G: 1 d —> C the following holds: 


E 

fceVd 


m®i Z G < p «> = Z 

u V ' Pc.?, 
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Using this and the permutation-invariance of Md(-)! and r a ^f), we can express the mean squared 
worst case error (11) as 


i r —i (p(k)) 

E(Q d , n {z)f = -r~^{ 0) + £ —— £ M d (P(fc))!^-—l P(fc)ei(z , n)J 

fc£V d A h p&s d ^ d 

= -*■«»+E 


h£Z d 


From the above considerations we thus infer that 

-U(Qd,n(2)) 2 = ^ ^(fo) lfceL( Z ,n)^ 

he z<<\{ 0 } ^ d 


= E E E 

^u£(Z\{ 

jd-#u ( 4(^d 


e=i uc{i,...,q, h u e(z\{o})“ 

ieu 


Md((fa u ; o))! nu . nnil 

r cn,/3y\™ a ’ ^1) ^-(h u \0)eL(z,n) ± 


= E E 

t=i uc{i r ..,q, 

ieu 


\ 1 -i (h \ ^ 

\ _ll( T miW / v //c 'q^v^uJ ^uGL^u.n) 1 ) 

V#(/ d nu)y hug( ^ {o})u #5 UiJd 


which proves the result. ■ 

Now we are well-prepared to state and prove the main theorem of this section. It presents a 
component-by-component construction to search for a generating vector z* £ such that (for 
some well-chosen shift A* £ [0, l) d ) the error of the shifted rank-1 lattice rule Qd,n(z*) + A* 
achieves a rate of convergence which is arbitrarily close to 0(n~ a ). 

Theorem 4.5 (CBC construction). Let d £ N, I d C {1 ,..., d}, and assume n £ N with n> cr 
to be prime. Moreover, let zf £ {1,..., n — 1} be arbitrarily fixed and select z%, ..., z d £ Z ra 
component-by-component via minimizing the quantities Bi d ^(zi, ..., z^_ lt zf), defined in (16), 
subject to zc £ Z„, (. = 2,..., d. Then there exists A* £ [0, l) d such that the shifted rank-1 
lattice rule Q d , n = Q d , n {z j(,..., z*f) + A* for integration of I d -permutation-invariant functions 
in F d (r ad 3) satisfies 

e WOT {Qd,n\&i d {F d {r ai p)))‘ 1 < (1 + c R ) x C d ,x(r a ^)ma.x{l,ffI d }^ (17) 

for all 1 < A < 2a, where C dt \(r a ^p) is given by (13). 

Before presenting the proof of Theorem 4.5, we stress that our bound (17) for the CBC 
construction is only slightly worse than the error bound found in the general existence result of 
Proposition 4.1. It always depends on the number of permutation invariant variables and thus 
cannot be used to deduce strong polynomial tractability. This seems to be unavoidable, see also 
Remark 4.6 below. However, note that this additional linear dependence on d is a noticeable 
overhead only for the case of A = 1; for A > 1, the exponential growth in overshadows 

this dependence on d. 

Proof of Theorem j.5. Our proof is based on the dimension-wise decomposition of the mean 
squared worst case error given in Proposition 4.4. Once we have found 2* = (z*,..., z*f) £ Z d 
such that the corresponding quantity E(Q d ^ n (z*)) 2 is upper bounded by the right-hand side of 
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(17), the result follows (as usual) by the mean value theorem which ensures the existence of a 
shift A* with 

e wor (Q d ,n(z*) + A* ; 6 Id (F d (r a ,p ))) 2 < E(Q d<n (z*)) 2 . (18) 

Step 1. Let A £ [1,2a), d £ N, I d C {1 ,... , d}, and n be a fixed prime. We apply Jensen’s 
inequality (see Lemma A.l) with p = l>l/A = gto the expression in (15) and obtain 


E(Q d , n (z)) 2/x < $ /X J2 Bl ^ z i’ • • • > **) 1A for a11 


(=i 


z £ Z 


d 

n’t 


with Bj d j as defined in (16). In Step 2 and 3 below we will show that if we select z = z* 
component-by-component, then for all £ = 1 ,..., d 1 the summands in the estimate are bounded 

by 


B Id ^{zl,...,z}) x / x 

< (1 + c R ) max{ 1, #/ d } 1/A - V 

n z —* 


(19) 


r 1/x 

’'Ujd 


E 


uc{i,..,4, 

ieu 


WA)! 

*S uJd 


,\i/A 


^u£(Z\{0}) u 

The recombination of these building blocks as in Proposition 4.4 then yields 


E(Q d ,n(z*)) 2/X <( 1 + Cfl)( ]T 

,hez<*\{o} 


M d (h)\ 

#S d ra 'P 


(h) 


1 l/A' 


max{l,#/ d } 1/A -. 


such that the claim (17) is implied by (18). 

Step 2. To prove (19) we apply Jensen’s inequality (again for p = 1 > 1/A = q) to (16) and 
obtain 


B Id ,i{z x ,...,zi) llx < 

uC{l,...,£} 

ieu 

( 20 ) 

for all zeZj and l = 1,..., d. 

Now consider 1 = 1 and assume z\ = z{ £ {1, 2,..., n — 1} to be fixed arbitrarily. Then the 
sum over all subsets in the latter bound reduces to u = {1}. Accordingly, we have S u ,i d = {id} 
and M u j d (h u )\ = 1 does not depend on h u . Moreover, for £ = 1 only those non-trivial indices 
h u belong to L(z u ,n ) J ~, i.e., satisfy h u ■ z u = 0 (mod n), which can be written as h u = h\ = nk 
for some k £ Z\ {0} because z u = z\ £ Z n and n is assumed to be prime. Hence, in this case 
we obtain that r a ^ A (/i u ) equals 


c~ 1/A 

C U,Id 


E 

h. u e(z\{o}) u 


Ai J d 


(KV- 


#S. 


u,Id 


L/A 


r 1/x 

a,P 


(h u ) i 


h u £L(z u ,n ) J 


<*,/3 


1 0 X (n k) = p\ /X R(n |fc|)“ 2a/A < (d\ ,X 


— R(\k\) 

CR 


— 2 ot/X 


CR 

n 


2a/X 


’S(k)< CJ rr~J X (k) 


O', (3 


( 21 ) 


with cr/u < (1 + cr) max{l,^t/ d } 1 / A /n (recall that A < 2a and n > cr). 


Consequently, we 
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have 


Bi d ,ite) 1 /x <C{HS d E 


hi=n k, 

kez\{o} 


< (1 + C R ) max{l,#/ d } 1/A i c uj d X E 

fiu€(Z\{0})“ 


uC{l}. 

lGu 




#5u, 


Id 


a,/3 


(fru), 


where A: was relabeled as h u . In other words, (19) holds true for 1 = 1. 

Step 3. Now, assume that we have already determined z*, ..., z\_ x for some £ £ {2,..., d}. 
Then the best choice z\ £ Z ra , i.e., the minimizer of Bi d ^(z x , ..., ^|_ 1; •), satisfies 

Bi d Az* 1 ,...,z* i _ 1 ,zt) 1/X 


< r&r E B Id Az* 1 ,...,z* e _ 1 ,ze) 1/X 

^ n z £ ez n 


< E 


c" 1/A 

C u,/ d 


E 


teu 


h.„e(z\{o})“ 


Hu,/ d (fcu)! 
#SuJ d 


1/A 


p-V A 

a,/3 


(/iu 




E i 


h. D -jz*+/i£ 2:£=0 (mod n) 


Zi&n 


where we used (20) and employed the notation 0 := u \ {/} and z* := as well as the 

shorthands h„ := (h u \ 0) D and hi := (h u )i. 

Next, we estimate the term in the brackets for every h u £ (Z \ {0}) u with u C 1} 

which contains £. The character property (10) yields 


#z n ^ 


^ n —1 ^ n—1 

+h t zi= 0 (mod n) = _ E _ E eX P( 27ri j( h « ' Z l + h lZl)/n) 

*e=0 i=0 

^ n—1 ^ n —1 

= - E ex P( 27ri • zl)/n ) - E exp(27ri jhizi/n) 


3 =0 
n—1 


2^=0 


<-E 

n ' 


j=0 , 


exp(27ri j(h D • z*)/n) 


E ex P( 27ri jhizi/n) 


Z£=0 


<1 


— (modn)^O 




where 


^ n—1 ^ n —1 


W„(M := - E _ E ex P( 27ri jhezi/n) = 


n *—' n 

3=0 ze—0 


1 if hi = 0 (mod n), 
1/n otherwise, 


due to Lemma 4.3 for d = 1. 

To exploit this estimate, given u = 0U{£} as above, we split up the sum over all h u = ( h v ,hi ) 
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from (Z \ {0}) u = (Z \ {0})° x (Z \ {0}) and obtain 


E 

R u e(z\{o})“ 


#Su,I d 


|\!/A 


r~^\K)W n (h e ) 


E 


E 


a 0 e(z\{o})“ h t e( z\{o}) 

hi= 0 (mod n) 




1 

H— 
n 


E 


R u e(z\{o})“ 

(mod n) 




#<V, 

(K)W 1/X 


*S uJd 


r 1,x 

OL,P 


(h u ). 


In the first term every he equals n k for some 0 ^ k £ Z. Thus, we can perform a change of 
variables similar to what was done in Step 2 and replace the inner summation by a sum over all 
k £ Z \ {0}. Then from the product structure of r a ^(-) and the bound (21) it follows that 


r a]3 X (( h *’ h t)) =r E /A ^°) r E /A ( nfc ) - V ^ r a, 1 /3 /A ((^,fc)). 

But now we also need to estimate M U) / d ((fa 0 ,/i^))l w.r.t. this transformation of variables. If 
^ Id, then replacing he = nk by k does not effect M u y d . On the other hand, if i £ Id, then, at 
least we know that 


-VA/ 


-!/A 


CR -1/A/ 


-1/-V 


CR -1/A, 


M Ui j d ((/i 0 ,n£:))! < max{l, #(J d 0 u)} • M uJd ((h 0 ,k))\ < max{l ,#I d } ■ M uJd ((h 0 ,k))\. 
Hence, dropping the condition he ^ 0 (mod n) in the second term finally yields 
I uMK)l\ 1/x 


E 


*s UJd 


r~T(K)W n (he) 


h u£(Z\{0}) u 

< max{l ,#I d } 1/x 


cr 

n 


E 

h. u e(z\{o}) u 


E E 

R»e(z\{o})» fee(z\{o}) 
MujMl\ 1/X —i/a 




#5u, 


Id 


#Su, 


Id 


< (1 + c R ) max{l ,#I d } 


l/A _ 


E 


a,/3 (^«) 

u,/ d (fcu)!\ lA -1/A 


^u6(Z\{0})“ 


*s UJd 


a,13 


(hu) 


for every subset u C which contains I. This immediately implies the desired bound (19) 

on Bi dj e(z *,..., z|) 1/,A and the proof is complete. ■ 

Remark 4.6. Let us add some final remarks on the CBC construction: 

(i) Note that this theorem asserts the existence of a z* that achieves the desired bound and 
gives a constructive way of finding it (by successively minimizing the quantities H/ d ^(zi,..., ze) 
subject to ze). 

(ii) As usual, all non-trivial choices for the first component z* of the generating vector z* £ Z(( 
are equivalent. 
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(iii) For the remaining components we actually assumed more than we needed: Step 3 in 
the above proof shows that instead of minimizing Bi d ^(z \,..., z%_ 1: zi) for zi G Z n , 
£ G {2, . ..,d}, it would be sufficient to select z* t which performs better than the aver¬ 
age (w.r.t. A), i.e., which fulfills 


B Ii ^z* 1 ,...,zU, 4 ) 1/X < 


#z, 


■ Y, B Id , t {zl,...,zi_ 1 ,zt) 1/x . (22) 


Moreover note that from Holder’s inequality it easily follows that the latter estimate yields 
a corresponding bound for all A < A. That is, selecting £ = 2,..., d, according to (22) 
implies (17) with A replaced by any A G [1, A], whereas minimizing Bi d n produces a lattice 
rule which satisfies the bound (17) for the whole range [1,2a) of A. 

(iv) Observe that (in general) a lattice rule based on a generating vector 2* constructed as above 
is unfortunately not extensible in the dimension. The reason is that with increasing d the 
factors c u j d in the definition of Bj d j (see (16)) will change such that the contributions of 
the subsets u C {1, ...,£} with £ G u are different for every dimension. Hence, for fixed £ 
the optimal choice z\ may change with varying d unless the set of permutation-invariant 
coordinates Id is independent of the dimension (which is the uninteresting case). Thus, 
to successfully apply the CBC construction, we need to know the target dimension in 
advance. Nevertheless, compared to a full search in the complexity of finding a suitable 
generating vector is significantly reduced if we do it component-by-component and parts 
of the calculations made for dimension d may be reused while processing the (d + l)’th 
dimension. 


(v) As mentioned earlier, our conjecture is that the dependence of (17) on the dimension d 

is inevitable and cannot be improved by alternate steps in our proof. The CBC construc¬ 
tion tries to distinguish between otherwise identical dimensions; so no CBC construction 
of lattice rules for the integration of permutation-invariant functions can satisfy bounds 
like (14), in our opinion. Indeed, (16) in Proposition 4.4 above already indicates the com¬ 
plicated influence of already selected components |’_ 1 on the error contribution 

induced by the choice of the current coordinate Zn (and forthcoming indicee). The fact 
that the problem is caused by the permutation-invariance assumption (M u> j d is not of 
product structure!) is nicely reflected by the factor max{l,#/<*} in (17) which disappears 
for standard constructions, i.e., for spaces without symmetry constraints. 

(vi) Finally, let us mention that slightly stronger assumptions allow to get rid of the maximum 
term in (17). Indeed, a careful inspection of the proof shows that actually (17) can be 
improved to 


e WOI (Q d , n ;&iAF d (r a ^))) 2 < 


cr\ 

n J 


max{l,#/ d } 1/A ( 
rriaxjl, c£ /X 


2a/X l 


1 A 




I A 


? 2a/A—1 


+ 1 


Cd,\{r a ,p) —r- 


Thus, if we assume that n > cr max{l, #I ( z} 1// *' 2a A \ then we exactly recover the error 
bound (12) from Proposition 4.1. □ 
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5 An alternative construction 


In this section we consider a completely different approach towards efficient cubature rules for the 
integration problem defined in Section 2. For this purpose, in Section 5.1 we collect some basic 
facts (taken from the monographs of Novak and Wozniakowski [6, 7, 8]) about L 2 -approximation 
problems on more general reproducing kernel Hilbert spaces (RKHSs) H d = H{K) of d -variate 
functions and on somewhat larger Banach spaces Bd D Hd- Then, in Section 5.2, we show 
that some knowledge about the quantities related to these approximation problems allows us to 
(semi-explicitly) construct a sequence of cubature rules Qd,N for integration on Hd with worst 
case errors that decay with the desired rate. Finally, this construction is applied to our original 
integration problem of permutation-invariant functions in Section 5.3. 

5.1 The .^-approximation problem for RKHSs 

Let K denote the reproducing kernel of some Hilbert space Hd = H(K) of real (or complex) 
valued functions / on [0, l] d . The inner product on this space will be denoted by (•,•) and 
If | Hd ,|| = 1 / (•, •) is the induced norm. Then from the reproducing kernel property it follows that 

\f(.x)\ = \{f,K(-,x))\ <\\f\H d \\ y/K{x^x) for all x £ [0, l] d , 

see, e.g., Aronszajn [1] where a comprehensive discussion of RKHSs can be found. Hence, if 

M 24 '■= M 2 , d {K) := ( K(x,x)dx (23) 

is finite, then the space Hd can be continuously embedded into the space L 2 ([0, l] d ) and it holds 
||/|L 2 || 2 < M 2 ,d\\f \ H d \\ 2 for every / <E H d . In other words, M 2 >d (K)H 2 serves as an upper 
bound for the operator norm 

IIAppJ := sup \\f\L 2 \\ of App d : H d L 2 ([0,l] d ), / App d /:=/. 

feH d ,\\f\H d \\<i 

If, in addition, the embedding Hd L 2 ([0,l] d ) is compact, then we can try to approximate 
App d by non-adaptive algorithms 

n 

Ad,nf:=^ L i(f) a i (24) 

i= 1 

which are linear combinations 1 of at most n information operations L\,... ,L n from some class 
A and arbitrary functions ai,... ,a n £ L 2 ([0, l] d ). In the worst case setting, the error of such an 
algorithm is defined by 

e wor (A d:n ; Appd) := sup II App d / - A d ,„f | L 2 || . 

feH d ,\\f\H d \\<i 

As long as we restrict ourselves to the class A = A al1 = H* d of continuous linear functionals the 
complexity of this problem is completely determined by the spectrum of the self-adjoint and 
positive semi-definite operator W d := App^App^ Due to the compactness of App d , W d is also 
compact such that it possesses a countable set of eigenpairs {(A d,m,Vd,m) \m £ N}, 

1 ^dVd.m — ^d,mVd,m- 

1 Note that non-linear and/or adaptive algorithms don’t give an advantage in the setting we are going to 

describe (see [6, Chapter 5] for details). Hence, our choice is w.l.o.g. 
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The eigenvalues A d,m are non-negative real numbers which form a null-sequence that is (w.l.o.g.) 
ordered in a non-increasing way. For the ease of presentation we restrict ourselves to the case 
where Hd is separable, dimiJ^ = oo, and A d ,m > 0 for all m £ N. Then the set of eigenfunctions 
{rjd,m | Tri £ N} forms an orthonormal basis of Hd- Furthermore, they are also orthogonal w.r.t. 
the inner product of L^dO, l] d ) and we have A d,m = \\Vd,m | L 2 \\ 2 for any m £ N. The optimal 
algorithm A* d n in the mentioned (worst case ^-approximation) setting is given by the choice 


Bi — ( * , T]d,i) 


( ,Vd,z) L 

--- and ai = rj d ,i, 

*d,i 


i = 1 ,..., n. 


(25) 


Finally, its worst case error equals the nth minimal worst case error which can be calculated 
exactly: 

e wor (n; App d , A al1 ) = e wor {A* dn ; App d ) = i/\ d , n+ i, n £ N 0 . 

We refer to [6] for a more detailed discussion. 

All the statements made so far are true for arbitrary Hilbert spaces Hd which are compactly 
embedded into L2([0,l] d ). Taking into account the reproducing kernel property and using the 
definition of the adjoint operator App^, it can be checked easily that Wd takes the form of an 
integral operator, 

W d f = f f(y) K(-,y) dy, f £ H d , 

J [0,1] d 


since for any f £ Hd and all x £ [0, l] d 

(AppfApp d /)(a;) = (App^App d f,K{-,x)\ = (App d /, App d A:(-,a:)) i2 = {f,K{-,x)) L 
= ( W d f)(x ). 


Among other useful properties, we also have that if M 24 as defined in (23) is finite, then 


M 2 ,d = 11%.™ I L z\\ 2 = ' Kd ’ m =: trace (Wd)i 

m£N mGN 


see, e.g., [7, Section 10.8]. In particular, this finite trace property immediately implies that 
A d ,m £ ^(m -1 ) or e wor (n; App d , A al1 ) € £>(n -1 / 2 ), respectively. 

In addition, M 2 ,d is also related to the average case approximation setting. To this end, 
assume that B d is some separable Banach space of real-valued functions on [0, l] d equipped with 
a zero-mean Gaussian measure /id such that its correlation operator C Md : B d Bd applied to 
point evaluation functionals L x can be expressed in terms of the reproducing kernel K of H d - 
That is, we choose B d and such that 


K(x,y) = L x (C IJ , d Ly)= [ f(x)f{y) dy d {f) for all x,y£[ 0, l] d , 

JB d 


where L x : B d —> R with / 1 —> L x (f) := f(x) for x £ [0, l] d . We stress that these assumptions 
imply a continuous embedding of H d into Bd- For more details, the reader is referred to [6, 
Appendix B] and [7, Section 13.2]. 

Again, we look for good approximations Ad, n f to / in the norm of L2([0,l] d ). Thus, we 
formally approximate the operator 

App d : B d ->■ L 2 ([ 0, l] d ), / App d / := /, 
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by algorithms of the form (24). Clearly, we need to make sure that Bd is continuously embedded 
into L 2 ([0,l] d ). The difference to the worst case setting discussed above is that this time the 
error will be measured by some expectation with respect to fid- For this purpose, we define 


e avg (A d , n ;App d ) ■■= 


'B d 


App d f - A d ,nf 


d/J-dif)] 


1/2 


which immediately implies that the initial error is given by 
e a v g (0; App d , A) = e avg (A d , 0 ; App d ) 


(26) 


/ [ f(x) 2 dxdn d (f)\ =([ K[x, x) daA = M}J]. 

' B d ■/[0,l] d / \^[0,l] d / 


Since A d ,o = 0 does not use any information on /, this holds for information from both classes A 
in {A all ,A std }, where A std denotes the set of all function evaluation functionals and (as before) 

A all 

is the set of all continuous linear functionals. 

In general it can be shown that for A = A al1 the nth optimal algorithm A* d n is again given 

by (24) and (25). To see this, we define the Gaussian measure v d := ^ d ° (App rf ) _1 on L 2 ([0, l] d ). 
Then the corresponding covariance operator C Vd : T 2 ([0, l] d ) —> .C 2 ([0, l] d ) is given by 


C Vd f = 



dx, 


which formally equals the definition of W d above, see [3, Formula (16)]. Consequently, its eigen- 
pairs {(Ad, m , Vd,m) | m S N} are known to be the same as in the worst case setting and it can be 
shown that the nth minimal error satisfies 


e avg (n; App d , A al1 ) = e avg (A^„; App d ) 



for all n G Nq. 


5.2 Quadrature rules based on average case approximation algorithms 

We are ready to return to the integration problems, the main focus of this article. Given the 
space Bd as above, let us define 


Intd : Bd —^ ffi, /H>Int d /:= [ f(x) dx, 

J [o,i] d 

and let Int^ := Int^l^ denote its restriction to the RKHS H d C B d we are actually interested 
in. As before, we approximate this integral by some cubature rule Qd, n given by (4). Then the 
average case error of such an integration scheme Qd,n on Bd is defined by 


e avg (Qd,n;Int d ) 



1 t'W,/' Qd,nf 


2 


\ 1/2 

<W/)J 


Now [7, Corollary 13.1] shows that this quantity is exactly equal to the worst case (integration) 
error for Qd, n on Hd which is given by (5); i.e., 


e WOI (Qd,n\H d ) = e wor (Q d ,„;Int d ) = e avg (Q dirl ; Int d ). 


(27) 
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Keeping the latter relation in mind, our final goal in this section is to construct a suitable 
quadrature rule using the following procedure: Given any algorithm Ad t n that uses at most 
n £ No function values (information from the class A std ) to approximate App d , and a set V = 
{t^,... ,t^} ofreN points in [0, l] d , we let 


Qd 


, r 

,n+r(f) ■= Qd,n+r(f\ V£*, A d , n ) := / (A d , n f){x) dx + - V (f(t W ) - (A dj „ 

•Amp r iri v 


(28) 


Note that this algorithm Qd,n+r denotes a cubature rule 2 for Int^ (and Int^, respectively) which 
uses no more than n + r function evaluations. The basic idea behind this construction is a form 
of variance reduction and can already be found in [15]; for a clever choice of A dj n, the maximum 
“energy” of / is captured by the approximation A dt nf (which is assumed to be integrated exactly) 
such that the remaining part f — A d , n f can be treated efficiently by a simple QMC method using 
r additional nodes. Therefore, it is obvious that the corresponding integration error caused by 
Qd,n+r will highly depend on the approximation properties of the underlying algorithm A,] n and 
the quality of the chosen point set Vffl- 

A first step towards the desired error bound for Qd,n+r is given by the following estimate 
which resembles a bound given in [3, Formula (21)]. It states that, given Ad n - a suitable point 
set can always be found. For the sake of completeness, the (non-constructive) proof based 
on the proof of [15, Theorem 3] is included in the appendix. 

Proposition 5.1. Let d,r € N. Then for any algorithm A d , n , there exists a point set Vff) = 
such that Qd,n+r as defined in (28) satisfies 

e av6 (Q d ,n+ r ; £t d ) 2 < ~ e avg (A d , n ; A^p d ) 2 . (29) 

r 

Remark 5.2. Of course, the sets are not uniquely defined and it may be hard to find 

these sets in practice. On the other hand, we can argue that although these bounds are non¬ 
constructive, it is known that slightly larger bounds can be achieved with high probability by 
any random set of points, see, e.g., Plaskota et al. [11, Remark 2], and hence we claim that a 
suitable set can be found semi-constructively, provided A d?n is given. □ 

In view of Proposition 5.1 and our construction (28) we are left with finding suitable algo¬ 
rithms Ad yU (based on at most n function evaluations) which yield a small average case L 2 - 
approximation error. This can be done inductively. For this purpose, observe that for each 
to € N 




1 

m 


m 



where := for j e N, 

V *d ,j 


defines a probability density on ^([O, l] 2 ). Here the f d: j's denote the l^-normalized eigenfunc¬ 
tions r]dj of Wd and C Vd , respectively, as described in Section 5.1. Given to £ N, and an algorithm 
A ( j s that uses s £ No function values to approximate App d , and a set V ... ,t^} of 

2 Indeed Qd n +r is °f the form (4) provided that Ad n is linear. 
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q G N points in [0, l] d , let 


A d>s+q (f) = A dtS+q (f-,V d pp , A dtS ,m) 

= jr ( (A d , s f,z dJ ) L2 + - (A dtS f)( t W) 


Mm(* W ) y 


£d,,j > 


/es dl 

(30) 


define another .^-approximation algorithm on B d that uses s + g evaluations of /. Here we adopt 
the convention that 0/0 := 0. Without going into details we want to mention that A d ^ s+q basically 
approximates the mth optimal algorithm A* d m (w.r.t. A al1 ) for App d ; see [8, Section 24.3] for 
details. For A d)S+q defined this way, we have the following error bound which can be found in 
[8, Theorem 24.3]. 

Proposition 5.3. Let d : m, q € N be fixed. Then for any algorithm A d}S , there exists a point set 
V d pp = ^ > d P q * suc h that A d ' S+q as defined in (30) fulfills 

e avg {A d)S+q ; A PP(i ) 2 < e avg (m; App d , A al1 ) 2 + ^ e avg {A d ^ App d ) 2 . (31) 

Remark 5.4. Again the proof of Proposition 5.3 is non-constructive since it involves an averaging 
argument to find a suitable point set V d p q *. However, once more, such a point set can be found 
semi-constructively at the expense of a small additional constant in the error bound above. □ 

Hence, to construct an approximation algorithm A d:U (as required for our cubature rule (28)) 
inductively, in every step we need to choose positive integers m and q such that the right-hand 
side of (31) is minimized. Of course, this requires some knowledge about the mth minimal 
average case approximation error e avg (m; App d , A al1 ) which will be provided by the subsequent 
assumption. 

Assumption 5.5. For deN, there exist constants C d > 0 and p d > 0 such that for the ordered 
sequence of eigenvalues of W d = App d App d (see Section 5.1), the following holds 

__ 00 S~1 

e avg (m; App d , A al1 ) 2 = V X dj < - - for each me N 0 . (32) 

( m+ l) Pd 

We note in passing that the bound for m = 0 shows that C d needs to be larger than M 2id . 
On the other hand, in concrete examples C d will not be too large, as we will see in the remarks 
after Theorem 5.8 and in Section 5.3 below. 

Based on Proposition 5.3 and Assumption 5.5 we now construct a sequence (A d ^)fc g N 0 of algo¬ 
rithms which perform well for average case approximation App d on B d . Afterwards Proposition 5.1 
will be employed to derive the existence of suitable error bounds for the numerical integration 
on the RKHS H d . 

For p > 0, let u>{y) := y + 1 /y p denote a function on the real halfline y > 0. Then 
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shows that ui(y) > 1 and thus 2 p+1 oj(y) 1+1 ^ p > 2 for all y,p > 0. In particular this holds for the 
minimizer y p :=p 1 /(? , + 1 ) of u>. Now let 


JC P := max jfc e N 0 2 k < 2 p+1 w(i/p) 1+1//p j = log 2 (2 p+1 u](y p ) 1+1 / p ^ J £ N, p > 0, 


and let d £ N be fixed. Then our sequence of algorithms (A 


(fc) 

d 


)feeN 0 on B d is defined by 



0 

A , ( ■ -p a PP’* 
A d,2 k l ) r d,2 k ~! 


t(fc-l) 


,m k ) 


if k<IC Pd , 
if k > IC Pd , 


(35) 


where we set 

Cd2 k ^ 

e av g(j4^ fe) ; App d ) 2 

with pd and C d taken from Assumption 5.5 and A d 2 * as given by Proposition 5.3 for s = q — 2 fc_1 
The proof of the following lemma can be found in the appendix. 



i/(pd+i) 


y Pd 


k > fcpd i 


(36) 


Lemma 5.6. Let Assumption 5.5 be satisfied. Then the sequence (A^) fegNo given by (35) is 

(k) u 

well-defined. Moreover, every A y d , k £ No, uses at most 2 K points and satisfies the estimate 


e avg (4 fc) ;App d ) 2 < c(p d ) C d ( 2 k )~ (37) 
where c(p d ) := 2 Pd( ' Pd+1 ' 1 (1 + p d ) (1 + 1 /Pd) Pd > 1. 

Remark 5.7. Although the factor c(p d ) in Lemma 5.6 is super exponential in p d it might not be 
too large in concrete applications. E.g., in case of the Monte Carlo rate of convergence (p d = 1) 
it is easily shown that c(l) = 16. Moreover, in this case the first non-trivial algorithm from our 
sequence is given by A^ 1+1 ^ = A^ and uses no more than 16 sample points. For p d = 2, we 
would have c(2) = 432 and again = L^°§2(12v / 3)J = 4. However, remember that we need 
to impose stronger decay conditions in Assumption 5.5 in order to conclude a higher rate p d in 
Lemma 5.6. □ 

We are ready to define and analyze our final quadrature rule Qd,N for integration on H d . For 
this purpose, let d £ N, as well as N > 2, and suppose that Assumption 5.5 is satisfied. Setting 
k := |fog 2 ATJ — 1 £ Nq the rule Q d ,N is given by 


Qd,N 


Qd. 2 -(-;P^,4 k) ) 


(38) 


where the algorithm A^ = A d ^ n is taken out of the sequence (A^)fc e n 0 considered in Lemma 5.6 
and Qd, 2 K + 1 is the quadrature rule from Proposition 5.1 (with n = r = 2 K ). From the previous 
considerations it is clear that Q d ,N takes the form (4) with integration nodes t <J ' > £ [0, l] d from 
the set 

K, — 1 


Vd,N ■= v u 


u 


'P a pp,> 
' d,2 k 


k—K, v 


(39) 


which can be found semi-constructively. Moreover, the subsequent error bound can be deduced 
by a straightforward computation. 
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Theorem 5.8. Let d € N, as well as N > 2, and suppose that the RKHS Hd satisfies Assumption 5.5. 
Then the cubature rule Qd,N defined by (38) uses at most N integration nodes and satisfies the 
bound 

/ 1 \ P d 1 

e wor (Q d ,iv;^) 2 < 2^ d+2 ^ d+1 \l + Pd ) ^1 + -J Cd -jy—pi- 

Proof. Obviously, the set Vd,N defined in (39) satisfies ffVd,N < 2 K + J2k=o 2 fc = 2 K+1 . From 
2 K+1 < N < 2 K+2 it thus follows that ffVd,N < N. In addition, (27) together with (29) and (37) 
yields 

e wor (Q d ,iv;^) 2 - e avs (W +1 (- -^^A^yMtd) 2 

<^ eavs (4 K) ;App d ) 2 

< ^ c(p d ) C d ■ (2“)-w = 2 a <w+ 1 > c( Pd ) C d {2 J )Pd+1 , 
with c(pd) as defined in Lemma 5.6. ■ 


Finally, let us recall a standard estimate which ensures the validity of Assumption 5.5. For 
this purpose, suppose that Y^f= i EE is finite for some t > 1. Then for k £ N, the non-increasing 
ordering of (A d,j)jen yields 


k oo 

3 =1 3=1 

Consequently, for all to £ N, we obtain 


i.e., 


Ad,fc < k~ T ( ^2 E( 


1/t 


U =1 


E^<(Ea:ji Er<[E« T 


A~ t ? t ~ 1 

< 


j=m-\-1 


<3=1 


j=m -\-1 


<3=1 


T — 1 T — 1 


EE 


<3=1 


d ' j I (to + l ) 7 


The case to = 0 can be handled using Jensen’s inequality (Lemma A.l) and the fact that 2 v /y > 1 
for y > 0: 


OO 


e ^<i,j < 

3=1 



T 


< 


2 T—1 
T — 1 



Thus, (32) in Assumption 5.5 is satisfied with 

2 r_1 

= r - 1 and C d = -t 

T — 1 



for 


T > 1, 


whenever EE converges. 

Corollary 5.9. For d £ N, let Hd ^2([0, l] d ) denote a RKHS and assume that there exists 
t > 1 such that the ordered sequence of eigenvalues ofWd = App^App d satisfies EE < oo. 

Then for the cubature rule Qd,N considered above it holds 

j N~ T for all N > 2, 


( OO 

EEE 
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where c'(t) := 2 T ^ ^ ( 731 ) • 

Remark 5.10. Note that the factor c'(r) in the latter bound deteriorates, as t tends to one or 
to infinity. However, it can be seen numerically that there exists a range for r such that c'(t) 
is reasonably small. E.g., for r £ [1.003,2.04], we have c'(r) < 350. Moreover, observe that 
in general r might depend on d, whereas for the special case of d-fold tensor product spaces 
Hd = <S>( = i it can be chosen independent of d. On the other hand, in this case 

{Vi|jeN}=(nA t< 

U=1 

implies (J2jLi ^djY = Yj T ) Td > (X^i A k ) d which grows exponentially with the dimen¬ 

sion, provided that the underlying space H 1 is chosen such that /^-approximation is non-trivial 
and well-scaled (i.e., if e wor (0; App l5 A al1 ) 2 = 1 = Ai > A 2 > 0). □ 

5.3 Application to spaces of permutation-invariant functions 

We illustrate the assertions obtained in the preceding subsection by applying them to the repro¬ 
ducing kernel Hilbert spaces &i d (Fd(r a: p)) defined in Section 2. For this purpose we first need 
to determine the sequence of eigenvalues (A d,j)jeN of the d-variate operator W& = Wd(Kdj d ) 
given by 


k = (k 1 ,...,k d ) eN d 


W d f = (App^App d )(f) = [ f(x) K d ,i d (■, x) dx, 

d[0,l] d 

where App d : &i d (Fd(r a ,p)) —>■ L 2 ([0, l] d ) is the canonical embedding and Kdj d denotes the 
reproducing kernel of the /^-permutation-invariant space &i d {Fd(r at p)) given in (3). Afterwards, 
we can make use of Corollary 5.9 for all r > 1 for which YYj =1 -E/ is finite. 

If #/d < 2, then &i d (Fd(r at p)) = Fd(r a ,p) is the d-fold tensor product of Fi(r a ,p) and thus 
it suffices to find the univariate eigenvalues A k = M,k, k £ N, see Remark 5.10. From [6, p. 184] 
it follows that these eigenvalues are given by 

{\ k \k£N} = {r a ,p(h)~ 1 \h£Z} (40) 

Pi Pi Pi Pi Pi Pi 

’ i?(l) 2a ’ R{ 1) 2 “ ’ R( 2) 2 “ ’ R( 2) 2 “ ’ " ‘ ’ R(j) 2 “ ’ R(j) 2a 

i.e., we have one eigenvalue Po of multiplicity one and a sequence of distinct eigenvalues Pi / R(m ) 2a , 
m £ N, of multiplicity two. If we assume that Po > Pi/R(m) 2a for all m £ N, then the latter 
fist is ordered properly according to our needs. Consequently, 

OO OO 

E E T = pI /t +2 pI /t E w 2a/T , r > 1, 

k— 1 m =1 




is finite if and only if the latter sum converges. Due to our assumptions on R it is easily seen 
that 


C( 2 a/r) 



< C(2 a/r) 



(41) 
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where £ denotes Riemann’s zeta function. Therefore, Y^kLi A[/ r < oo if and only if 1 < r < 2a. 
In addition, from Remark 5.10 and (13) we infer that 


EE T 

G =1 



I ^ C d,r(r a ,/3) 


, h£Z d 


(note that M d(h)\ = 1 = #Sd for all h £ Z d since #/<j < 2) which shows that in this case the 
new error bound from Corollary 5.9 is worse than the estimate known from Proposition 4.1. 

If > 2, then &j d (F,((r a _p)) is a strict subspace of the tensor product space Fd(r a j 3). 
However, in [16] it has been shown that still there is a relation of the multivariate eigenvalues 
A d,j with the univariate sequence given in (40). In fact, for all 0 ^ Id = {ii, ■ ■ ■, i#i d } C {1,..., d} 
it holds 


{A d j| J6N} 




e=i 


k = € 


N d with k h < 


k‘i-2 E 


. . < k; 


#I d 


and hence the quantity of interest can be decomposed as 


OO 



1=1 


EE T 

,fc=l 


d-#I d 


#Id 


e iie 

fee# 1 -!, ^=1 

ki<...<k #Id 


1/r 


(42) 


Moreover, it can be shown that this expression is finite if and only if A \/ T < 00 which 

holds (as before) if and only if 1 < r < 2 a. 

In order to exploit the decomposition (42) for Corollary 5.9 we proceed similarly to the 
derivation of [10, Proposition 3.1]. That is, we bound the second factor with the help of some 
technical lemma (see Lemma A.2 in the appendix below) and obtain that for all U £ No it holds 


l 


#id 




fee n # j A £=1 

ki<...<k#i d 


*Id 




L ( A .(£) ' 

IP " 


1 / 1 


L—l 

2(n+i)<l[ i) <...<ii L) 


Ai 


< A f Id/T {#I d ) 2U 


#id 

2P+E 

L—0 


00 / \ \ 1/ T 

Aj 


E 

l=2(t/+l) 


(43) 


Given 1 < r < 2a let us define 

1/t 


1=2(17+1) 


°° / \ \ i/r / o \ !/r 00 

er(f/) : = E (^J E i?(m) ” 2 “ /T forever y 


Then the finiteness of (41) implies that there necessarily exists some U* := 
with 


0* = Qr(U*) < 1 . 


U;(R,a,(3,T) eN 0 
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In conjunction with (42) and (43) this finally yields that \ l J^ can be upper bounded by 


X (d~#I d )/T 


,fc=1 


E f A m 

l +T 


!/t 


i~#Id 


(2 u; + 


= < /T 


1 + 2 (|) E«<™r 2,,/ ' 


1/r 00 


-I d-#I d 


2u: 


1 - 0* 




2u: 


<e(0,d-,& ld (F d (r a ,p))) 2 ^ 


1 + 2 




i~#Id 


2U1 


1-0? 


(#/d) 


2U1 


since Xf = p$ = e(0 , d; &j d (F d (r a ^))) 2 . Thus, Corollary 5.9 implies the subsequent result. 

Theorem 5.11. Consider the integration problem on the Id-permutation-invariant subspaces 
&id(Fd{'ra,p)), where d > 2 and I d C {1,..., d} with #I d > 2. Moreover, assume that 


Pi 

Po R(m) 2a 


< 1 


for all m € N. 


(44) 


Then for every N >2, the worst case error of the cubature rule Q d ,N defined in (38) satisfies 


e^iQd^', &i d {F d {r a ,p ))) 


<e(0 ,d-,&i d (F d (r a ,p))) [2U; + 


1 - 0 ? 


r/2 


2 t(t 2 -1)/2 


1+2 


(_Pl 


J2a \ l / T 


\PoR(l) 2a 

for all 1 < t < 2a and Uf £ No such that 


C(2 a/r) 


T — 1 
(d-#I d )r/2 


r/2 


Qr = ‘ 2 \Pr 


Pi\ 


1/r 00 


£ R(rn)- 2 ^ < 1 . 

/ ^_ TT* I 1 


{#I d y u * N~ T/2 (45) 


(46) 


m—U * +1 


Remark 5.12. We conclude this section by some final remarks on Theorem 5.11. 

(i) First of all, note that using a sufficiently small (but constant) value of f3\ the condition (46) 
can always be fulfilled with Uf = 0. 

(ii) Observe that the bound (45) combines the advantages of the general existence result 
for QMC algorithms (Proposition 3.1) with the higher rates of convergence in the esti¬ 
mates for lattice rules (as well as their component-by-component construction) stated in 
Proposition 4.1 (and Theorem 4.5, respectively). In fact, (45) structurally resembles the 
bound (8) from Proposition 3.1; besides the initial error and some absolute constants it 
contains a term with grows exponentially in d — ffl d (but not in d itself!), as well as a 
polynomial in f)I d - On the other hand, similar to the lattice rule approach, the Monte 
Carlo rate of convergence is enhanced by a factor of r. Hence, the construction given in 
Section 5.2 provides a cubature rule which achieves a worst case error of 0(n~ a ) while 
the implied constants grow at most polynoinially with the dimension d , provided that we 
assume a sufficiently large amount of permutation-invariance (i.e., if d — ffl d £ O(lnd)). 
In other words, it can be used to deduce (strong) polynomial tractability. 
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(iii) We stress that (in contrast to the rank-1 lattice rules studied in Section 4) Qd,N is a general 
weighted cubature rule and its integration nodes do not necessarily belong to some regular 
structure such as an integration lattice. However, as exposed in Section 5.2, they can be 
found semi-constructively. 

(iv) Finally, we mention that the condition (44) improves on (7) by a factor of two. □ 


Appendix A 

In this final section we collect auxiliary results, as well as some technical proofs we skipped in 
the presentation above. 


A.l Auxiliary estimates 

For the reader’s convenience let us recall a standard estimate which is sometimes referred to as 
Jensen’s inequality. 


Lemma A.l. Let be an arbitrary sequence of non-negative real numbers. Then, for 

every 0 < q < p < oo, 

1 /p / \ 1/9 


£< 

V =1 



whenever the right-hand side is finite. 


In addition, in Section 5.3 we make use of the following technical lemma (with a m := Xl/, T 
for 77i € N and s := #Id) which was employed already in [10] and [16]. For a detailed proof (of 
an insignificantly modified version), we refer to [18, Lemma 5.8]. 


Lemma A.2. Let (cr m ) m£ N be a sequence of non-negative real numbers with <T\ = sup mgN tr m > 
0 and set <7L,k '■= Tlf=i a k e f or k 6 and L £ N. Then for all U £ No and every s £ N it holds 


( 


\ 


^ cr s ,k<crls 2U 




1 + 2 U + ^ ^ 1 


£ 




ke N s , 

/Cl <...<k s 


V 


l=i 

2 (u+i)<j[ L) <...<jL L) ) 


with equality at least for U = 0. 


A.2 Postponed proofs 

We start with showing Proposition 5.1 which relates the average case integration error of Qd,n+r 
with the error of the approximation algorithm Ad, n used to construct Qd,n+r, see Section 5.1. 

Proof of Proposition 5.1. For any fixed collection of points t^\ ..., t^ £ [0, l] d it is easy to see 
that 

_ _ 1 r 

Int d f ~ Qd,n+rf = Int rf (/ - A d ,nf) -^(/ - A dj „/)(t w ). 

r e=i 
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Squaring this expression and taking the expectation with respect to t^\ l = 1,..., r, gives 


[0,l] d x ■■■x [0,l] d 


Int df ~ Qd,n+rf di (1) ... di M = ^ (||/ - A dyn f \ L 2 1| 2 - (Int d (/ - A dyn f)) 2 ^j , 


which implies (by integrating over B dl interchanging the integrals, and estimating the negative 
term) that 


' [0,l] d x---x [0,l] d 


5 (Qd,n+r-;Intd ) 2 di (1) ... dt (r) < 


\\f~A d , n f\L 2 \\ 2 d p d (f). 


Now the integral on the right-hand side is nothing but e avs (A d ^ n ; App d ) 2 and by the usual 
argument (mean value theorem) there exists a point set V' d l’* = {t^\... ,t^} C [0, l] d such 
that e livg (Q din+r ( ■; P d n ^’*, A din ); Int d ) 2 is smaller than the average on the left. ■ 

It remains to prove Lemma 5.6 which justifies the iterative construction of the sequence of 
approximation algorithms (A^)/-eN 0 - 

Proof of Lemma 5.6. For this proof let p := p d and C := C d denote the constants from Assumption 5.5. 
We are going to prove that A^ 1 as defined in (35) satisfies 


e avg (4 fc) ■A PPd ) 2 <2^ p+1 ^(y p r +1 C-^ for all k G N 0 . (47) 

Then (33) together with the definition of y p :=p 1 /(P+ 1 ) implies the claim. 

Step 1. Let us consider k € {0,1,..., 1C P } first. Note that due to the considerations after 
formula (33) this set is not empty. Since for k < IC P we have 2 p(i5+1 ) u](y p ) p+1 (2 k )~ p > 1, it 
suffices to show that e avg (A^; App d ) 2 < C in this case. Since A^ = 0, this is true due to (26) 
and (32) applied for m = 0. The remaining assertions are trivial for k < 1C P . 

Step 2. Observe that m*, as defined in (36) is at least one if and only if 

e avg (A^; App d ) 2 < C2 k y p+1 . 

Moreover, the right-hand side of (47) is less than or equal to C 2 fc y p+1 if and only if 


2 p (l + J*) < 2 fe . (48) 

Therefore, every k £ No which satisfies (47) and (48) fulfills mk > 1. 

We stress that the conditions (47) and (48) hold true at least for k = JC p . In fact, the validity 
of (47) has been shown already in Step 1 and (by the definition of IC P ) we have that 

2 k p < 2 p+1 oj(y p ) 1+1/p < 2 Kp+1 , 

which implies 2 ICp > 2 P ui(y p ) 1+1 ^ p > 2 P (l + \/y p+1 ) using (34) for the last estimate. Furthermore, 
note that with k = KL P the condition (48) holds true for every k > KL p . 

Step 3. Now we prove (47) for A ( 'J Cp+1 \ For this purpose, we let k = K, p and make use of 
Proposition 5.3 with m = to*, and q = 2 k (here we need that to*, > 1). Employing the definition 
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of ruk given in (36) together with (32) from Assumption 5.5 we obtain 


e avg 


(A 


(fe+i). 


A PPd) 2 


< e avg (m fc ; App d , A al1 ) 2 + ^ e avg (A^ } ; App d ) 2 

^ C 1 /^ 1 ) e avg (A^ fc) ; App d ) 2p Ap+ 1 ) c-i/(p+i) e av g ^« ; ^pj2 P /(p+i) 

” 2 fc P/(P +1 )?/^ 2 fc P/(P +1 ) yp 


= ^i/Ip-h) 


5 avg (Ai fc) ;A PPd ) 2 


( 2 fc) P 


p/(p+i) 


2Mi/ P ) 

(2 fe+1 )P ' 


Finally, we plug in the upper estimate (47) for the error of A^' 1 and derive the same bound for 

A ^ d +1 ) = A^ p+1 \ But now Step 2 yields that also to^: p +i > 1 such that the same reasoning as 
before implies (47) for k = )C P + 2 as well, and (by induction) for any further k. 

In conclusion these arguments show that the tail sequence (A^)k>K p is well-defined and 
that it satisfies the claimed error bound. Since in the construction we add at most 2 fc_1 points 
when turning from k — 1 to k, each operator A^ obviously uses no more than 2 fc sample points. 
Hence, the proof is complete. ■ 
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